Algebraic gossip on Arbitrary Networks 



Dinkar Vasudevan and Shrinivas Kudekar 
School of Computer and Communication Sciences, EPFL, Lausanne. 
Email: {dinkar . vasudevan, shrinivas . kudekar}@epf 1 . ch 



On 
O 
O 
(N 



> 



O 

On 
O 



X 



Abstract — Consider a network of nodes where each node 
has a message to communicate to all other nodes. For this 
communication problem, we analyze a gossip based protocol 
where coded messages are exchanged. This problem was studied 
in [4] where a bound to the dissemination time based on the 
spectral properties of the underlying communication graph is 
provided. Our contribution is a uniform bound that holds for 
arbitrary networks. 

I. Introduction 

There are n nodes in a network, each having an information. 
The goal is to disseminate the information at every node to 
every other node in the network. We assume that the network is 
composed of noiseless links and that the nodes are restricted to 
communicating with their neighbours defined by the network. 
Our goal is to characterize the dissemination time for protocols 
based on pairwise communication between nodes. 

The problem formulation is motivated by potential appli- 
cations in sensor networks and databases and is not new. 
Deb et. al [3] considered the problem where k messages are 
disseminated in a network of N nodes. However, their analysis 
is restricted to complete graphs. The problem when all the 
nodes have information to communicate has been considered 
by Aoyama and Shah [4] for arbitrary networks. They refer 
to this problem as the information dissemination problem and 
report bounds to the dissemination time based on the spectral 
properties of the underlying graph. We study the same problem 
and report uniform bounds to the dissemination time, which 
depend only on the number of nodes and not on the topology 
of the network. (See discussion in Section. [IV]for a comparison 
of results) 

The paper is organized as follows - The Section. HI] goes 
through notation and preliminaries, Section. [Til] provides the 
main result and Section. [1V] puts the result in the context of 
existing results on the problem. 

II. Preliminaries 

The network is represented by graph G = (V, E) where V 
represents the set of nodes and E represents the set of edges 
in the network. Let N be the number of nodes in the graph. 
The only restriction on the graph G is that we require it to be 
connected. 

A. Asynchronous Time model 

We consider an asynchronous time model where we assume 
there is a global clock that ticks according to a Poisson process 
with mean N. At each global clock tick, a node i in Q is picked 
uniformly at random for communication. Then the node i picks 



a neighbour j independently and uniformly at random. The 
probability of selecting the node j is denoted by Pjj = -K 
where d, is the degree of node i. The pair of nodes then 
exchange messages (described in the following). Thus, we 
consider here a push and pull based protocol. 

B. Data Transmission - Random Linear Network Coding 

The messages exchanged during the communication are 
linear network codewords (see [1]). Each message is a vector 
of r symbols with each symbol an element of a finite field ¥ q 
of size q. Let F q denote the vector space of messages. Let us 
denote the initial messages at each node by m, £ FJ. Assume 
that all the initial messages are linearly independent vectors 
in F£, At any time instant t, each node i has a set of coded 
messages 
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along with the set of coefficients {aik}, 1 < k < N. The set 
of coded messages at a node span a subspace of the vector 
space F£. For node i at a time t, we denote this subspace 
by S(i,t). When the node i is picked for communication, it 
randomly picks a vector from its subspace S(i,t) and transmits 
the generated vector along with its coefficients (with respect 
to the basis {m,k}^ =1 ) to node j. More precisely let, 
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Then node i transmits to node j the vector along with 
the set of coefficients (^J2 a ikPi 



E ¥ q for 1 < k < N. On 



receiving this information from i, the node j does likewise. It 
sends to node i a randomly generated vector in its span S(j, t) 
along with the coefficients. We note here that the overhead 
of communicating coefficients, which is N symbols in ¥ q is 
amortized by choosing r large enough. 

C. Notation 

We define the various quantities which appear in our result. 

Definition 1 (Stopping Time): The stopping time T is de- 
fined as 

T = min{dim(S'(i, t)) = N for all i e G} 

At time T, each node i has message space of dimension TV 
and hence can recover all the initial messages nik, 1 < k < N. 



Definition 2 (6 -information spreading time): The 5- 
information spreading time T$ is defined as the minimum 
time t such that 

Pr{dim(S*(i, t)) = N for alH G Q} > 1 - S 
We introduce the notion of the predecessor graph W(i, t) of 
node i at time t. This graph contains nodes whose messages 
potentially influence the messages received at node i by time 
t. The graph is constructed in levels - the first level contains 
the nodes which exchange messages directly with i, the second 
level contains nodes which exchange messages directly with 
the level one nodes and so on. A precise definition now follows 
(it is not required for understanding the proof and can be 
skipped on a first read.) 

Definition 3 (Predecessor Graph): Define W^(i,t) as the 
node i itself. Let VV'^(i,t) denote the set of nodes which 
have contacted or been contacted by i until the time t. The 
superscript (1) indicates that these are nodes at "level one". 
Let T^^) denote the most recent time (preceding t) at which 
the node I G (i, t) contacts i. We make similar definitions 
for W (D) (i, t) and (I) for D > 2, i.e., for level two nodes, 
level three nodes etc. in the following. For D > 2 define 

w {D \i,t)= |J w^(iM D - 1} (i)) 

and for k G W (D) (i,t), 

t (k) = max I Most recent time preceding 

when k is in contact with it. j 

We finally define the set of nodes in the predecessor graph 

W(i,t) as 

W(i,t)= |J W (D) M 

We now define quantities related to the predecessor graph 
which we use in the proof. Let lZ(i, t) be a set of all distinct 
nodes in W(i, t) and S(lZ(i, t)) denote the space spanned by 
the messages of all the nodes in 1Z(i, t). It holds that S(i, t) C 
S(K(i,t)). 

III. Main result 

Theorem 1: If q > A 5 , the expected stopping time E[T] 
for the information dissemination problem for any network 
satisfies 
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Sketch of Proof: We show that after 24A 2 log A global 
clock ticks (on an average), if nodes i and j have the same 
message space, then with high probability both the nodes have 
recovered all messages. We then show that the probability of 
dimension increase beyond this time is lower bound by the 
fraction of nodes with message space dimension less than 
N. The expected number of global clock ticks to completion 
beyond 24A 2 logA is upper bound by -^-5A 2 logA. 



Proof: Let p(t) be the probability of strict increase of the total 
dimension (J2i e g dim(S(i, t))) at clock tick t. 

^ E i E P ^ Pr (S(i, *) + t))(l - )) 



The above follows since the dimension strictly increases if two 
nodes with nonidentical subspaces exchange messages and at 
least one of the messages does not lie in S(i) D S(j). The 
latter probability is lower bound by the factor (1 — |). 

We provide an upper bound © to Pr(S(i,t) — S(j,i)). 
This is shown through the following two lemmas. In Lemma 
Q] we show that if S(i,t) = S(j,t) then it holds with high 
probability (provided that the field size is large enough) that 
S(i,t) = S(lZ(i,t)), i.e., the node i can decode messages of 
all the nodes in its predecessor graph. 

Let us define the event A = C\ ue n(i t){S{j>t) 3 m u }. In 
words, A denotes the event that the message of every node in 
1Z(i,t) is contained in S(j,t). 

Lemma 1: 

Pr(5(i, t) = S(j, t)) < Pr(5(i, t) = S(j, t),A) + — 

q 

The proof is in the Appendix. The second lemma uses results 
from [5] which show that for t large enough, the predecessor 
graph of node i contains all nodes in Q with high probability. 
Combined with the previous lemma, this implies that for such 
a t if S(i,t) — S(j,t), then node i can decode w.h.p the 
messages of all the nodes of Q. Let ^(t) be the fraction of 
nodes in the graph which have decoded all the initial messages 
after t number of clock ticks. 
Lemma 2: For t > 24 A 2 log A, 

Pv(S(i,t) = S(j,t),A) <^(t) + ^ 

where j(t) is the fraction of nodes i such that dim(5(i, i)) = 
n. 

The proof is in the Appendix. From Lemmas Q] and |2] it follows 
that for t > 24A 2 logA 
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Pr(S(i, t) = S(j, t)) < 7 (t) + — + — (3) 

Let Tfc denote the number of clock ticks till the total 
dimension increases by k(k — 1). Clearly, tm corresponds 
to the number of clock ticks when all nodes can decode all 
the messages. Let t G (r m ,r m +i]. For all such t we have 
l(t) < 7(r m +i). Since the dimension increase till r m +i is, 
by definition, given by (m + l)m, we have 



j(t) < 7(r m+ i) 
Therefore 
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Pv(S(i,t) = S{j,t)) < 
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TABLE I 

Comparison of bounds: The first two columns are our bounds, 
the next two are the conductance bounds of [4]. 
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where the last inequality follows by choosing q > N 5 . 
Substituting (0]l in © we have that for t 6 (r m , r m+ i], 



P(t) > 1 - 
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where x" 1 corresponds to the number of clock ticks required to 
increase the total dimension by 1 in the interval [r m ,T m+ i). 
In the Appendix we compute E[tjv], under the assumption 
that p(t) is lower bound as in (0 (which is true for t > 
24iV 2 log iV ), and show it is upper bound by -^5N 2 log N. 
This implies that E[T tlcfes ] < 24iV 2 log N + ^j5N 2 log N < 
-^j29N 2 log N, where T t i C ks is the expected stopping time 
in terms of number of clock ticks. Let Zi denote the random 
time between the i — 1 th and the i th global clock ticks. The 
Zi's are i.i.d exponentials of rate N. It follows that 
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Corollary 1: The ^-information spreading time is given by 

T s = -^-58iVlog^log 27 
q — 1 o 
By Markov's inequality, it follows that 

q 



Pr(T > 



587V log N) < - 



Over log 2 | epochs, each of time ^^-58^ log N, it follows 
that the probability of failure of information dissemination 
is bounded to within 5. If S 

0{N log 2 N). 



-L, then we have Ti 
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IV. Discussion 

Deb et. al [3] considered the information dissemination 
problem (where there are, in general, k < N messages 
to be disseminated) on a complete graph. They show that 




Fig. 1. Lolli-pop network 



the -i- -information spreading time 7j_ is Q(N). This is a 
stronger result than what we obtain, which is 0(N log 2 N). 
However, their analysis is specific to complete graphs and is 
not extensible to arbitrary graphs. 

In [4], the authors compute the expected stopping time E[T] 
and the -^-information spreading time Ti for a pull based 
protocol and show it to be bounded as O(^) where 
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The quantity $p is called the conductance of the network. 
Thus their result is E[T] < 0(f), Ti < €>{§). We refer 
to this bound as the conductance bound. For a complete 
graph and a ring graph, the conductance bound computes to 
O(iVlogiV) and 0{N 2 ) respectively [4]. We compute the 
conductance bound for yet another class of graphs. 

Consider a Lolli-pop network (see Figure. 1), which consists 
of a complete graph on -j nodes, denoted by Kn_, to which 

9 AT ^ 

is attached a string, denoted by C, of =j- nodes. 
For this graph and for any k it holds that 

The above holds since for any k, the set S with minimum 
conductance can be constructed so that there is only one 
outgoing edge. Therefore 

N-l , 2V-1 
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= 0(N 6 ) 

Thus for the lollipop graph, the conductance bound is 0(N 2 ), 
In contrast to conductance bounds which vary depending 
on the network, our result states that the expected stopping 
time T and the -4 -information dissemination time Ti for 
any network is bounded as 0(N log N) and (9(A^log N) 
respectively. A comparison of the bounds with [4] is illustrated 
in Table. J] Our bounds for E[T] scales the same or better than 
the conductance bound. Our bounds for T±. scales worse for 

N 

the complete graph, but scales better for the ring and lollipop 
graphs as compared to the conductance bound. We note here 



that worsening of our bound for the complete graph is because 
we are not able to prove a concentration style result (which 
says that Ti is of the same order as E[T]), and hence incur 
the penalty of log N. 
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Appendix 

A. Proof of Lemma Q] 

Recall that A = fU^;,*)^ (•?>*) 2 m 4- 

Pr(5(i, t) = S(j, t)) = Pr(S(i, t) = S(j, t), A) 

+ Pr(S(i,t) = S(j,t),A c ) (6) 

where A c is the complement event. In the following we 
show that 

Pr(S(i,t) = S(j,t),A c ) 
is small if the field size is large. We have, 

Px(S(i, t) = S(j, t),A c ) = Pr (s(i, t) = S(j, t), 

< Pt{S(i,t) = S(j,t),S(j,t)3>m u ) 

< P*{S(i,t) = S(J,t)\S(j,t)l>m u ) (7) 

u£K(i,t) 

The message percolates from the node u to i along possibly 
many paths in the predecessor graph (enumerated by all 
possible paths from u to i in the predecessor graph). Consider 
any such path and remove the cycles in the path. Now along 
the path every node is distinct. 

Let u = lo,h, ■ • . ,lk = i denote the distinct vertices of the 
path and let /i^ denote the message sent from to U for 
1 < i < k. Also define B = S(j, t) ^ m u . We have 

Pr(S(i,t) = S(j,t)\B) < Pr(/i fe G S(j,t)\B) 
= Pv( t i k &S(j,t),^ 1 ^S(j,t)\B) 
+ Pr(fi k g S(j,t),ii k -i G S(j,t)\B) 

< Pr(/i fc G S(j,t)\^! i S(j, i),B)+Pr(/i fc _i G S(j,t)\B) 
Recursively using the above relation, we get 

Pi(S(i,t) = S(j,t)\B) < 
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^Pr( Mb (t) G i S(j,t) t B) 
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(a) h AT 

+ Pr(p 1 (t) € S(j,t)\B) < -<- (8) 

q q 

The inequality (a) follows since Pr(/^b(i) G 
S{j,t)\^b-i{i) ^ S(j,t),B) implies that the node lb-i 
weights the message /ib_i(i) with a specific symbol in ¥ q . 
(otherwise /ib_i(i) would be in S(j,t)) The probability of 
this event is upper bound by ~. The last inequality follows 
since the path has distinct nodes and hence the length is 
bounded by N. 

Substituting ([8]l in (0 gives 

Pr(S(i,t) = S(j,t),A c )< V -<— (9) 

q q 

Substituting in (O, we get 

Pr(S(t,f) - S(j,t)) < Pr(S(i,*) = S(j,t),A) + —. 
B. Proof of Lemma [2] 

RecaU that A = Cluen^iSij^) 2 m «l- We have ' 

Pr (S(i,t) = S(j,t), A) 

= Pr (s(i, t) = S(J, t), A, dim(5(i, t)) = n\ 

+ Pr (S(i, t) = S(j, t),A, dim(5(i, t)) < n^j 

< Pr (dam(S(i,t)) = nj 

+ Pr (S(i, t) = S(j, t),A, dim{S(i, t)) < nj 

= 7(t) + Pr (S(i, t) = S(j, t), A, dim{S(i, t)) < nj (10) 

where we used the fact that at any time instant, a node is picked 
independently, uniformly at random from the entire network, 
hence Pr(dim(S(i,t)) = n) = j(t). 

Note that {S(i,t) = S(j,t),A} is the event that the 
message of every distinct node in the predecessor graph of 
node i is contained in S(i,t). Thus, the node i can decode 
the initial message of all the distinct nodes in the predecessor 
graph. But since dim(S(i,t)) < n, there must exist a node 
which is not present in the predecessor graph. Thus we have, 

Pr(s(j,t) = S(i,t),A,dim(S(i,t)) < nj 

< Pr(There exists a node u not present 
in the predecessor graph W(i, t j) 

( a ) x ^ 

< y, Pr(w has not branched to i within t) (11) 

The argument (a) follows from the union bound and invokes 
a new term - "branching" defined in [5]. In [5], the authors 
consider a randomized protocol for broadcasting a message 
over an arbitrary network under the synchronous model and 
compute an upper bound to the expected time by which a 



message originating at a node spreads all over the network 
with high probability. The meaning of the term branching is 
the following - A node u branches to i within time t if and 
only if it is present in the predecessor graph of i at time t. 
Note that the branching process itself is defined on the original 
graph Q and not on the predecessor graph. In the following, 
we modify the arguments in [5] to upper bound the probability 
in CTfll. 

Pr(w has not branched to i within t) 
< Pr(w has not branched to i within t 
along the shortest path from u to i) 

Let u = xq, x\ , . . . , x m = i denote the shortest path from u to 
i in Q. A vertex w not on the path cannot be connected to more 
than three vertices on the path. (Otherwise, the shortest path 
would be via w). Thus it follows that Yl™=o deg{%r) < 3A. 
Let t r ^ r+ i denote the time to transmit a message from x r 
to x r+ \. Clearly, E{t r ^ r+ i) = p N — = Ndeg(x r ). The 

expected time, E(t u .i) to transmit the message along the 
shortest path from u to i is therefore upper bound by 3A 2 . 
From the Markov inequality, it follows that Pr(i„ i > 6A 2 ) < 
i. Therefore, after 4 log A epochs of duration 6 A 2 each, 



Pr(i Uii > 24A 2 logA) < 2 
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If t > 24 A 2 log A, we continue dTT) as 



Pr(S(j,t) = S(i,t),A,dim(S(i,t)) < n) < VPr(t„ > t) 



use 



< E Pr (^ > 247V 2 log A) < ^3. 
Substituting the upper bound to the probability in ( [Tol l. 



(12) 



Pv[S(j,t) =S(i, t),A) < l(t) + j^ (13) 



C. Computation of E[tn] 



The computation is done in two parts. We first compute 
E[tat_!] and then E[tjv — Tzv-i]- We omit the prefactor 
in the computations and append it directly in the result. 
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<3A 2 lnA (14) 



In the above, (a) follows by bounding the summation (exclud- 
ing the term for m = A — 1) by the integral. 

We now compute the expected time from rjv-i to tm- Let 
us define Tj^_ 1 as the time in the interval (tat-i, tjv] when the 
cluster of nodes which have decoded all messages is of size 
m. Let mo be the size of the cluster at rjv-i. Thus m > mo. 

Since t > 24A 2 logA we use Lemma [Hand Lemma [2] and 
thus for t G [t^j x , t^ 7 ^ 1 -*) we have 

2 m 2 

Pr(S(i, t) = *)) < 7(*) + < jj + (15) 
Thus probability of dimension increase in the interval 
i T j^v T N^i^'l ^ s l° wer bound by 



, . / m 2 , 
p{t) > (l - — — -^3) 



(16) 



Since the interval (tn-i>tn] contains A(A — 1) — (A — 
1)(A — 2) = 2(A — 1) dimension increments, it follows that 
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Summing the above over m going from mo to A — 1, we get 

N-l 

E[t n -t»-1]<(2N-2)J2 j —L z 

m V N JVV 
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-» (JV- m -4r) 

10 V JS J 

<2A 2 lnA (17) 

Combining (TT4b and (fTTT i we get 

EfrArl < — — 5A 2 lnA. 
9 — 1 



